Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Simultaneous Document Margin Removal and Skew Correction Based on Corner Detection in Projection Profiles

Identifieur interne : 000908 ( Main/Exploration ); précédent : 000907; suivant : 000909

Simultaneous Document Margin Removal and Skew Correction Based on Corner Detection in Projection Profiles

Auteurs : Mehdi Haji [Canada] ; D. Bui [Canada] ; Y. Suen [Canada]

Source :

RBID : ISTEX:DFD938AAD9B41127DA9E2CDF0DDEA95C9B8A83A3

Abstract

Abstract: Document images obtained from scanners or photocopiers usually have a black margin which interferes with subsequent stages of page segmentation algorithms. Thus, the margins must be removed at the initial stage of a document processing application. This paper presents an algorithm which we have developed for document margin removal based upon the detection of document corners from projection profiles. The algorithm does not make any restrictive assumptions regarding the input document image to be processed. It neither needs all four margins to be present nor needs the corners to be right angles. In the case of the tilted documents, it is able to detect and correct the skew. In our experiments, the algorithm was successfully applied to all document images in our databases of French and Arabic document images which contain more than two hundred images with different types of layouts, noise, and intensity levels.

Url:
DOI: 10.1007/978-3-642-04146-4_109


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Simultaneous Document Margin Removal and Skew Correction Based on Corner Detection in Projection Profiles</title>
<author>
<name sortKey="Haji, Mehdi" sort="Haji, Mehdi" uniqKey="Haji M" first="Mehdi" last="Haji">Mehdi Haji</name>
</author>
<author>
<name sortKey="Bui, D" sort="Bui, D" uniqKey="Bui D" first="D." last="Bui">D. Bui</name>
</author>
<author>
<name sortKey="Suen, Y" sort="Suen, Y" uniqKey="Suen Y" first="Y." last="Suen">Y. Suen</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:DFD938AAD9B41127DA9E2CDF0DDEA95C9B8A83A3</idno>
<date when="2009" year="2009">2009</date>
<idno type="doi">10.1007/978-3-642-04146-4_109</idno>
<idno type="url">https://api.istex.fr/document/DFD938AAD9B41127DA9E2CDF0DDEA95C9B8A83A3/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000542</idno>
<idno type="wicri:Area/Istex/Curation">000535</idno>
<idno type="wicri:Area/Istex/Checkpoint">000430</idno>
<idno type="wicri:doubleKey">0302-9743:2009:Haji M:simultaneous:document:margin</idno>
<idno type="wicri:Area/Main/Merge">000916</idno>
<idno type="wicri:Area/Main/Curation">000908</idno>
<idno type="wicri:Area/Main/Exploration">000908</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Simultaneous Document Margin Removal and Skew Correction Based on Corner Detection in Projection Profiles</title>
<author>
<name sortKey="Haji, Mehdi" sort="Haji, Mehdi" uniqKey="Haji M" first="Mehdi" last="Haji">Mehdi Haji</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Centre for Pattern Recognition and Machine Intelligence, Concordia University, 1455 de Maisonneuve Blvd. West, H3G 1M8, Montreal, Quebec</wicri:regionArea>
<wicri:noRegion>Quebec</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Canada</country>
</affiliation>
</author>
<author>
<name sortKey="Bui, D" sort="Bui, D" uniqKey="Bui D" first="D." last="Bui">D. Bui</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Centre for Pattern Recognition and Machine Intelligence, Concordia University, 1455 de Maisonneuve Blvd. West, H3G 1M8, Montreal, Quebec</wicri:regionArea>
<wicri:noRegion>Quebec</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Canada</country>
</affiliation>
</author>
<author>
<name sortKey="Suen, Y" sort="Suen, Y" uniqKey="Suen Y" first="Y." last="Suen">Y. Suen</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Centre for Pattern Recognition and Machine Intelligence, Concordia University, 1455 de Maisonneuve Blvd. West, H3G 1M8, Montreal, Quebec</wicri:regionArea>
<wicri:noRegion>Quebec</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Canada</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2009</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">DFD938AAD9B41127DA9E2CDF0DDEA95C9B8A83A3</idno>
<idno type="DOI">10.1007/978-3-642-04146-4_109</idno>
<idno type="ChapterID">109</idno>
<idno type="ChapterID">Chap109</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: Document images obtained from scanners or photocopiers usually have a black margin which interferes with subsequent stages of page segmentation algorithms. Thus, the margins must be removed at the initial stage of a document processing application. This paper presents an algorithm which we have developed for document margin removal based upon the detection of document corners from projection profiles. The algorithm does not make any restrictive assumptions regarding the input document image to be processed. It neither needs all four margins to be present nor needs the corners to be right angles. In the case of the tilted documents, it is able to detect and correct the skew. In our experiments, the algorithm was successfully applied to all document images in our databases of French and Arabic document images which contain more than two hundred images with different types of layouts, noise, and intensity levels.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Canada</li>
</country>
</list>
<tree>
<country name="Canada">
<noRegion>
<name sortKey="Haji, Mehdi" sort="Haji, Mehdi" uniqKey="Haji M" first="Mehdi" last="Haji">Mehdi Haji</name>
</noRegion>
<name sortKey="Bui, D" sort="Bui, D" uniqKey="Bui D" first="D." last="Bui">D. Bui</name>
<name sortKey="Bui, D" sort="Bui, D" uniqKey="Bui D" first="D." last="Bui">D. Bui</name>
<name sortKey="Haji, Mehdi" sort="Haji, Mehdi" uniqKey="Haji M" first="Mehdi" last="Haji">Mehdi Haji</name>
<name sortKey="Suen, Y" sort="Suen, Y" uniqKey="Suen Y" first="Y." last="Suen">Y. Suen</name>
<name sortKey="Suen, Y" sort="Suen, Y" uniqKey="Suen Y" first="Y." last="Suen">Y. Suen</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000908 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000908 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:DFD938AAD9B41127DA9E2CDF0DDEA95C9B8A83A3
   |texte=   Simultaneous Document Margin Removal and Skew Correction Based on Corner Detection in Projection Profiles
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024